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ABSTRACT 

We study the impact of catastrophic errors occurring in the photometric red- 
shifts of galaxies on cosmological parameter estimates with cosmic shear tomog- 
raphy. We consider a fiducial survey with 9-filter set and perform photo-z mea- 
surement simulations. It is found that a fraction of 1% galaxies at Zspec ~ 0.4 is 
misidentified to be at 2;phot ~ 3.5. We then employ both fitting method and 
the extension of Fisher matrix formalism to evaluate the bias on the equation of 
state parameters of dark energy, wq and Wa-, induced by those catastrophic out- 
liers. By comparing the results from both methods, we verify that the estimation 
of Wo and Wa from the fiducial 5-bin tomographic analyses can be significantly 
biased. To minimize the impact of this bias, two strategies can be followed: 
(A) the cosmic shear analysis is restricted to 0.5 < z < 2.5 where catastrophic 
redshift errors are expected to be insignificant; (B) a spectroscopic survey is con- 
ducted for galaxies with 3 < Zphot < 4. We find that the number of spectroscopic 
redshifts needed scales as Agpec oc /cata x ^ where /cata = 1% is the fraction of 
catastrophic redshift errors (assuming a 9-filter photometric survey) and A is the 
survey area. For A = 1000 deg^ we find that A'spec > 320 and 860 respectively 
in order to reduce the joint bias in {wq, Wa) to be smaller than 2a and Icr. This 
spectroscopic survey (option B) will improve the Figure of Merit of option A by 
a factor xl.5 thus making such a survey strongly desirable. 

Subject headings: cosmology-gravitational lensing, large scale structure of uni- 
verse 
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1. Introduction 

The weak gravitational lensing effect by large scale structures, namely the cosmic shear, 
is beheved to be a powerful tool to measure the cosmological parameters including the equa- 
tion of state of dark energy. Over the last ten years, weak lensing observations have already 
provided increasingly better constraints on the matter density VLm and the amplitude of 
mass fluctuations often represented by erg, the rms amplitude of the smoothed and linearly 
extrapolated mass density perturbations on the scale of 8 Mpc h'^ (Hoekstra, Yee & Glad- 
ders 2002, Jarvis et al. 2003, Rhodes et al. 2004, Heymans et al. 2004, Hoekstra et al. 

2006, Semboloni et al. 2006, Massey et al. 2007, Benjamin et al. 2007, Fu et al. 2008). 
Their contributions to the constraints on the total neutrino mass have also been investigated 
(Tereno et al. 2008, Gong et al. 2008, Ichiki et al. 2008, Li et al. 2008). Several more 
ambitious weak lensing surveys targeting at hundred to thousand times larger areas, such as 
DEdll, LSStI, JDEmI and EUCLIE0, are being planned aiming at exploring the properties 
of dark energy in great detail. 

Future large surveys are expected to be able to control statistical errors to insignificant 
levels. Therefore different systematics can become dominant sources of contaminations in 
future weak lensing studies, and should be investigated carefully. These include the effect 
of the point spread function (PSF) (e.g., Amara & Refregier 2008), the methodology in the 
shear measurement (Bacon et al. 2001, Erben et al. 2001, Hirata & Seljak 2003, Heymans 
et al. 2006, Massey et al. 2007, Miller et al. 2007, Kitching et al. 2008, Bridle et al. 2008, 
Ngan et al. 2008, Semboloni et al. 2008), redshift errors (Ma et al. 2006, Huterer et al. 
2006), intrinsic alignments of source galaxies (see Heavens 2001 for a review. Bridle & King 

2007, Fan 2007), uncertainties in the computation of the non-linear power spectrum (see 
discussion in Huterer 2002, Van Waerbeke et al. 2002), and so on. In this paper, we focus 
on the impact of the photometric redshift errors on cosmological studies. 

The necessity of measuring vast amount of galaxy redshifts in cosmic shear observations 
makes the multi-color photometric determination of redshifts the only feasible way in ob- 
taining the redshift information of source galaxies. However the inaccuracy of photometric 
redshifts can contaminate considerably the cosmological information embedded in cosmic 
shears (Huterer et al. 2006; Ma et al. 2006; Amara & Refregier 2007; Wittman et al. 2007; 

^http: / / www.darkenergysurvey.org 
^http://www. lsst.org 
■^littp:/ /jdem. gsfc.nasa.gov 

"'http: / / sci.esa.int / science-e / www / object / index. cfni?fobjectid=42266 
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Abdalla et al. 2008; Margoniner & Wittman 2008; Stabenau et al. 2008). Taking into 
account the bias and the scatter of Zpi^ot with respect to 2;spcc! Ma, Hu & Huterer (2006) 
analyze the corresponding degradation of the constraints on dark energy. It is found that 
these errors can completely erase the additional information from tomographic lensing ob- 
servations. In order to make full use of the advantage of the tomographic binning, both the 
bias and scatter of 2;phot have to be known to better than 0.003-0.01. Amara & Refregier 
(2007) and Abdalla et al. (2008) pay special attention to the so-called catastrophic errors in 
Zphot, the outliers with j^^gpec — -^photl > a-nd explore their effects on the figure of merit of 
dark energy parameter constraints. 

While most of the existing studies concentrate on the error range of parameter con- 
straints, the systematic bias in the determination of cosmological parameters due to the 
photometric redshift errors should also be carefully analyzed. A large systematic bias can 
lead to a completely wrong conclusion on the values of the cosmological parameters al- 
though the statistical errors may be small (Amara & Refregier 2008). Our study presented 
in this paper emphasizes precisely this aspect of effects on dark energy parameter estimates 
arising from catastrophic failures in photo-z measurements. We focus on a survey with 
similar design as described in Aldering et al. (2004), with its 9-filter set described also in 
Dahlen et al. (2008) and Jouvel et al. (2009 in prep.). We analyze in particular the bias 
on dark energy parameter estimates induced by the catastrophic failures occurred around 
(zspcc, -Zphot) ~ (0.4,3.5). And we investigate in detail the requirements of the number of 
spectroscopic redshift measurements to retrieve the correct estimates. 

The outline of the paper is as follows. In §2, we introduce the methods to describe the 
lensing observables and to evaluate the biases. We present the results of bias in §3. In §4, 
we discuss the calibration requirements to remove the biases due to the catastrophic failures. 
Discussions and conclusions are given in §5. 

2. Methodology 

In this section, we describe the formalism of lensing observables. Based on simulation 
results, we then model the source galaxy distribution with catastrophic failures taken into 
account. Finally we discuss the methods to evaluate the corresponding biases on dark energy 
parameter estimates, without and with spectroscopic calibration. 
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2.1. Lensing Observables 

The convergence power spectrum -Pj^(^) for the ith and jth redshift bin can be expressed 
as (see Kaiser 1992, Kaiser 1998, Ma et al. 2006) 

/■°° H(z) 
m) = l dzW,{z)W,{z)^^P{ke,z), (1) 

where H{z) is the Hubble parameter and D{z) is the comoving angular diameter distance. 

The matter power spectrum P{ke, z) with ki = £/D(z) is computed from the linear transfer 
function of Bardeen et al. (1986), and then the non-linear fitting function of Peacock & 
Dodds (1996). The weighting function Wi is given by 



X 



,, Dls{z,z') 



where Dls{z-i z') is the angular diameter distance between z and z' , and rij is the galaxy 
distribution (normalized to unity) of the ith redshift bin. We employ the overall galaxy 
distribution of the form (Small et al. 1994) 

n{z) oc exp [-(1.41;^/;^^ed)T (3) 
where 2;ined is the median redshift of a survey and we take a — 2, /3 = 1.5. 

Considering only the shot noise and the Gaussian sample variance, the covariance matrix 
of the lensing observables can be written as (e.g., Huterer et al. 2006, Ma et al. 2006) 

Gov [C5(f ), CM] = ^^^^["^'f^^^^ [Ctum^m + CmC-M ■ (4) 

where is the band width in multipole £, and /sky is the fractional sky coverage of a survey. 
The total power spectrum C^j is given by 

CS = ^ + ^.4, (5) 

"'9 

where cr^ is the variance of each component of the intrinsic ellipticity of source galaxies, and 
is the surface number density of galaxies in the ith bin. 

In our study, we consider a fiducial space-based survey with /sky = 0.025, corresponding 
to 1000 deg^, the summation of n^, i.e., the total surface number density of source galax- 
ies is taken to be 100 arcmin"^, (7^=0.22 and 2;nied=l-26. We are aware of the likely 
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extension of the survey area. Thus in analyzing the requirement of the spectroscopic red- 
shift cahbration, we derive a scahng relation that allows us to estimate the corresponding 
calibration requirement for any survey area and any catastrophic error fraction. 

Investigations by Ma et al. (2006) show that tomographic analyses with multi-bins of 
source galaxies (denoted as A^bin) can improve the statistical errors on cosmological param- 
eter constraints considerably in comparison with those of 2-D analyses. The improvement 
increases with the increase of A^bin- On the other hand, we cannot gain much further im- 
provement with iVbin > 5. Hence throughout the paper, we consider lensing tomography 
with 5 equally spaced redshift bins within the range of z = [0, 3]. All galaxies with z > 3 
are simply added to the last bin. The resulting redshift binning is illustrated in Figure [2j 



2.2. Modeling galaxy distribution with catastrophic failures 

Photometric estimations of galaxy redshifts through multi-waveband observations largely 
depend on the characteristic features of the spectral energy distribution (SED) of galaxies, 
such as the Lyman break and Balmer/4000A break. The locations of the features give us 
the redshift information of the galaxies. Therefore the accuracy of photometric redshifts is 
sensitive to the wavelength coverage of observations. It is also affected by the observational 
signal-to-noise ratio and the estimation method. Besides, the spectroscopic calibration plays 
an important role in improving the accuracy of photo-z, which shall be discussed specifically 
in §4. 

To realistically assess the accuracy of photometric redshifts for our survey, we first 
consider a 9-filter set which covers a wavelength range from visible blue to infrared with 
A ~ 390 nm to ~ 1700 nm (For specific definition, see Dahlen et al. 2008). Then a series of 
simulations are carried out . The details of the simulations will be presented in Jouvel et al. 
(2009 in prep.). Here we give a brief description. The simulations are performed with the 'Le 
Phare' software. First we generate galaxies according to the GOODS luminosity function 
by Dahlen et al. (2005), each with an assigned spectrum based on the Coleman Extended 
templates (Coleman et al. 1980). This set of templates contains 66 spectra obtained by 
interpolating between 5 standard spectra: elliptical, Sbc, Scd, irregular and star forming 
galaxies. We then apply the filtering process to the simulated spectra with the 9-filter set. 
We thus obtain the SED for each simulated galaxy. The magnitudes of galaxies are randomly 
assigned in accordance with the properties of our fiducial survey. The photometric redshift 
for each galaxy is then computed using the LePhare photometric redshift code. The results 
are shown in Figure [H 
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It is seen that scatters and biases around Zgpec exist for Zphot- Besides, there are notable 
islands in the ^spec-^^phot plot. The dominant one is located at z^pcc ~ 0.4 and Zphot ~ 3.5, 
which accounts for more than 50% of the galaxies with |zphot — -^spccl > 1- This island 
obviously comes from the confusion of the 4000A break with the Lyman break in galaxy 
SEDs due to the limited wavelength coverage. It is found that the fraction of galaxies with 
such large catastrophic errors is about 1% of the total number of galaxies. In our study, we 
particularly analyze the effect of this island on the constraints of dark energy parameters. 

To characterize the true distribution of galaxies whose photometric redshifts suffer from 
catastrophic failures, we use the following Gaussian distribution with the form 



^cata GXp 



(6) 



2a2 

where the 'true' mean redshift of the island Zm = 0.398 and the standard deviation a = 0.108 
are obtained from fitting to the spectral(true)-z distribution histogram of the island. The 
parameter Acata is determined by normalizing the Gaussian distribution to the overall fraction 
of catastrophic failures /cata = 0.01. Then in a tomographic division according to galaxies' 
photometric redshifts, this part will be mistakenly shifted from low-z bin to another high-z 
bin. Specifically, for the A^bin = 5 tomography we consider, the 1% galaxies with their true-z 
falling in the first bin are assigned to the last (5th) bin whose redshift range is Zphot > 2.4. 



2.3. Evaluation of biases on dark energy parameter estimates 

Once the distribution of catastrophic failure fraction is known, we can evaluate the 
biases it induces on dark energy parameter estimates. 

As mentioned in the previous subsection, we need to be aware of the fact that obser- 
vationally, source galaxies are assigned to different redshift bins in terms of their 'observed' 
photometric redshifts rather than their true redshifts. Hence, a fraction of low redshift galax- 
ies can be mistakenly distributed to high-redshift bins due to the catastrophic errors. As a 
consequence, weak lensing signals from those relevant tomographic bins are perturbed and 
biases on cosmological parameter estimates may arise, if models used to fit the data were 
not modified correspondingly. 

Concerning the fiducial survey in this study, we know from simulations that there will 
be /cata = 0.01 galaxies being misplaced into the 5th bin (denoted as 'Bin-5'), whilst their 
true redshifts are within the range of the first bin (denoted as 'Bin-1'). The true distribution 
of this catastrophic failure part can be described with Eq. (j6]) and will be labeled as Bin-cata 
for convenience in the following discussion. Note that the true z-range of Bin-cata is within 
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the z-range of Bin-1. 

For Bin-1, when Bin-cata galaxies within it are misplaced in Bin-5 due to catastrophic 
redshift errors, the number of galaxies contained in Bin-1 is reduced. The redshift distribu- 
tion of the galaxies left in it also changes somewhat in comparison with the true distribution 
without the catastrophic errors. The number change in Bin-1 does not affect the lensing sig- 
nal because the galaxy distribution within the bin is always normalized to unity. The shape 
change of the redshift distribution in the bin may have some effects. On the other hand, 
however, this change may already be included in the modeled overall redshift distribution of 
galaxies derived from photometric redshifts. Thus in our consideration, we assume that the 
misplacement of Bin-cata does not affect the lensing signal from Bin-1. 

For bins in between Bin-1 and Bin-5, obviously the lensing signals should be kept un- 
perturbed. Combining the previous analysis for Bin-1, we thus get C^j{i) = C^j{i), where 
i,j = 1,...,4. 

To the contrary, the lensing signals from Bin-5 get perturbed. As shown in the bottom- 
left panel of Figure [31 it in fact consists of two parts of galaxies, one with their true redshifts 
falling in Bin-5 and the other being a contamination from Bin-cata. Correspondingly, besides 
the auto spectra of Bin-5, those cross spectra with one of the bins being Bin-5 will also be 
affected. In this case, those convergence power spectra can be expressed as 

r°° H(z) 
PtS) = J^ dzW.{z)W,{z)^^P{h,z), (i = l,...,5) (7) 

Dls{z,z') 



with 

W^iz) = ■ ■ I dz'n^iz', /cata, z^, cr) 



D{z') 

where we omit the antecedent terms identical to those in Eq. ([2]). The true (spectral) galaxy 
distribution of Bin-5 can be written as 

n5{z, /cata, Zm, (t) = ^ [n5{z) + nci,ta.{z, /cata, Zm, Cx)] 

/S ~r /cata 



1 



/5 + /c 



cata 



Z Zrt 



\2 



{725(2;) + Acataexp — } (9) 



where l/(/5 + /cata) is to normalize n^{z, fc^t^,Zm,o) to be unity, with /s being the overall 
fraction of galaxies with their true redshifts falling in Bin-5. Here the right side terms n^i^z^ 
and ncata are normalized to their overall fraction /s and /cata, respectively. Acata is thus 
determined by /cata, as mentioned in the paragraph following Eq. ([6]). With the substitution 
ng — i> + rf^^^ in Eq. ([5]), finally we get the 'observed' total spectra Cj^(£, /cata, -^m, c"), 
where /cata = 0.01, Zm = 0.398, a = 0.108 and i = 1, 5. 
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In practical analysis, when calculating the 'theoretically expected' lensing signals for 
Bin-5, we have no knowledge, in the case without spectroscopic calibration, of the fraction 
of low redshift galaxies and regard all the galaxies being at high redshifts; or in the case 
with inadequate spectroscopic calibration, we have inaccurate knowledge therefore can make 
unprecise estimation of the fraction and distribution of low-z galaxies (see Figure [3]). The 
former can be denoted as C^^{£,0,c,c), with c being an arbitrary constant. The latter case 
gives C^^{i, fca,ta.,Zm,(^), where the estimations /cata,^m and a can deviate from their true 
values depending on the calibration size. 

Consequently systematic biases may arise when the theoretically expected signals C^j 
are compared with the observed ones C^j to constrain cosmological parameters. As an 
approximation, a simple extension of the Fisher matrix formalism can be used to compute 
the bias in cosmological parameters with (see Huterer et al. 2006, Amara & Refregier 2008) 

= ^ij' E [^-(^) - Gov-' [C^ii), C^ii)] (10) 

where a and P each denote a pair of redshift bins. The term [C^{i) — C^{i)] reflects the 
bias on the shear signals. The covariance of the cross power spectra is given in Eq. F^^ 
is the inverse of the Fisher matrix 




where C is the column matrix of the convergence power spectra and Gov is the inverse of 
the covariance matrix. 

Eq. ffTOj) is a linear approximation around the fiducial model. On the other hand, the 
true extent of bias can be quantified by a direct chi-square fitting method, by minimizing 



X' = Y1 - Cov-^ [C^ii), C^^ii)] [C^^ii) - C^^ii)] . (12) 

e,a,l3 

In our analyses, we will utilize both bias evaluation methods. The comparison of their 
results permits us to quantify the extent to which the linear approximation is valid. We shall 
come back to this in the following section, associated with the specific study for our fiducial 
survey. 

Throughout our study, a 7 parameter fiducial fiat cosmological model is assumed. For 
equation of state of dark energy, we consider the form w{z) = wq + Waz/{1 + z) (Chevallier 
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& Polarski 2001). The fiducial values of the 7 parameters are: flm = 0.27,1^0 = —l,Wa = 
0,h = 0.72, as = 0.78, Qb = 0.0446, n = 0.96, respectively. 

All the weak lensing related calculations are done with the ICOSMO package (Refregier, 
version 200^, with modifications from 2D weak lensing to tomographic form for our specific 
study. The fitting tool is MPFIT0 from C. Markwardt. 

3. The bias on dark energy parameter estimates 

To demonstrate clearly their significance, in this section, we present the results of bias 
on dark energy parameters due to catastrophic failures if there is no spectroscopic calibration 
for the problematic Bin-5. We apply both methods by fitting and by computation with the 
extension of Fisher matrix formalism. In the next section, we will focus on the requirements 
for the spectroscopic calibrations. 

First, we employ the fitting method. As described in the previous section, for Bin-5, 
the observed signal is a combination of the two parts illustrated in the bottom-left panel 
of Figure O On the other hand, the model calculation is done with the galaxy distribution 
illustrated in the top panel of Figure [31 The fitting is then applied to find the 'best- 
fit' values of the cosmo logical parameters by comparing the model-predicted lensing signals 
with the 'observational data'. In this fitting process, the 'measurement errors' for each 'data 
point' are theoretically calculated using a binned form of Eq. (jl]). 

In our calculations, the i range is taken to be [50,3000]. We discard weak lensing 
information from all those scales beyond £ = 3000 to avoid complications from various 
small-scale effects, such as baryonic cooling (White 2004, Zhan & Knox 2005, Jing et al. 
2006) and non-Gaussianity (White & Hu 2000, Cooray & Hu 2001). We apply binning 
procedures to i. According to Hu & White 2001, the bin width should be at least twice 
as big as ^fieid? in order to reduce the covariance between the bins arising from the survey 
geometry. We then take 10 bins, uniform in logarithmic scale over the range i = [50, 3000]. 

In the right panel of Figure HJ we show the resulting bias on the equation of state of 
dark energy obtained from fitting. Gaussian priors a{Qb) = 0.01 for Qf, and a^pi) = 0.05 
for others are applied upon the hidden parameters in our 7-parameter fiducial model. These 
priors are consistent with the current constraints. It is found that the 1% catastrophic 



^for a latest version, see|http://www.icosmo.org/| and Refregier et al. (2008) 
^http://cow. physics. wisc.edu/~craigm/idl/fitting. html 
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failure fraction can bring large biases on wq and Wa in the tomographic 5 z-bins case, highly 
significant compared to their statistical errors. The 'best fit' values are Wq = —3.5 and 
Wa = 3.8 in comparison with the fiducial ones Wq = —1 and Wa = 0. 

The results from the extended Fisher matrix formalism are shown in the right panel 
of Figure HI The same ^-binning process and Gaussian priors are applied. The 'best fit' 
values are found to be Wq = —4.2 and Wa = 6. Thus the linear approximation results in a 
notably larger bias than those of the fitting. This significant discrepancy reveals clearly 
the necessity to go beyond the linear approximation from the Fisher matrix formalism. 

In summary, our analyses show that for our fiducial survey with a 9- filter set, the 
catastrophic photo-z errors with /^ata ~ 1% can bias the dark energy parameters at a level 
far exceeding the statistical errors if no further spectroscopic calibration is conducted. 

Our calculations are done for a survey area A = 1000 deg^, corresponding to a sky 
coverage /sky = 0.025. Increasing the survey area reduces the statistical errors with a oc 
1 / a/ /sky On the other hand, the absolute biases on cosmological parameters should not 
change much. This can be seen clearly from Eq. (ITOil in which /sky enters through and 
Cov~^. Since both are proportional to f~^^, their dependencies on /sky cancel out. Thus with 
a larger sky coverage, the effect of the bias relative to the statistical errors would be even 
larger if the catastrophic fraction /^ata remains ~ 1%. 



4. Requirements of Spectroscopic Calibrations 

The large bias seen in Figure H] clearly reveals the necessity to take extra measures 
to reduce the effects of the catastrophic redshift errors. Among others, the spectroscopic 
calibration can play an crucial role in this regard. It is known that the spectroscopic cali- 
bration can be done either by taking spectra of a sub-sample of source galaxies after photo-z 
measurements or by employing a sophisticated technique where the spectral information 
is included in the determination of photometric redshifts. We consider the former case in 
our analyses, and explore the requirements of the spectroscopic calibration. Specifically, 
we study the needed number of galaxies with measured spectroscopic redshifts in order to 
retrieve the unbiased parameter estimates. 

Since we particularly analyze the 'island' catastrophic failures around (-Zspec, -Zphot) ~ 
(0.4,3.5) seen in Figured! we consider the calibration for galaxies with their photo-z in the 
range 3 < 2;phot < 4. We assume that our simulation results shown in Figured] reflect the real 
relation between photo-z and the true redshifts for observed galaxies. In other words, we 
take the simulation results as the flducial distribution with an overall /cata = !%• We then 
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investigate how many spectra are needed for the sub-sample of galaxies with 3 < 2;phot < 4 
in order to gain some information on the catastrophic contamination, mainly the value of 

/cata ■ 

Our analyzing procedures are as follows. We use the simulated galaxies with 3 < 
-Zphot < 4 as our parent fiducial sample. Then for a given Nspec, where A'spec is the number 
of galaxies chosen for spectral measurements, we perform a large number of Monte Carlo 
random samplings of Nspcc galaxies from the parent catalog. For each realization, we first 
compute the estimated fraction of catastrophic failures /cata by counting the number of 
galaxies among A'^gpec that are identified in the realization to be in Bin-cata, taking into 
account the overall fraction of galaxies that have 3 < 2;phot < 4 . With this sub-sample of 
Bin-cata galaxies, we then fit their distribution to the form given by Eq. to obtain the 
estimated Zm and a, denoted as and a. With these calibrated values of /cata, and a, 
we are able to calculate the model-predicted signals. By comparing them with those from 
the fiducial distribution, we therefore can evaluate the biases on cosmological parameters 
for each individual realization. The biases arise from the differences between the calibrated 
(/cata, Zm, ct) and the fiducial (/cata, Zm, 0"). With the results from all the realizations, we 
can further evaluate statistically the level of bias from A^spec calibrations. 

Figure [5] shows the results on wq and Wa of 100 realizations for A^^gpec = 100 (upper 
panels) and iVspec = 500 (lower panels), respectively. The symbols are the 'best fit' values 
from individual realizations, and the statistical errors are calculated at the fiducial values 
for the purpose of clarity. The left panels are for the results of full calculations including 
both the auto- and cross- correlations between different redshift bins. The right panels are 
the results considering only the auto-correlations within individual redshift bins. In each 
panel, the pluses and triangles correspond, respectively, to the results from fitting and 
from the Fisher matrix calculations of Eq. fllOj) . First when the bias is large and the fitted 
values are outside the 3a contour, the results from the linear approximation of Eq. (ITUl) 
start to deviate from the fitting results, as seen most evidently in the upper right panel. 
This shows the limitation of the linear approximation. If in some cases, very large biases are 
expected, it is necessary to perform a full fitting to evaluate quantitatively the biases. 
Otherwise, an over-estimate of the bias on wq and Wa would be obtained from the linear 
analysis. This is in good accordance with the results shown in Figure |H On the other hand, 
when the biases are within the 3a statistical contour, the two sets of results are in excellent 
agreement. Thus in this regime, the linear approximation of Eq. (fTOj) provides us an accurate 
and convenient way to estimate the bias effects. We further find that the triangles from the 
linear calculations are almost perfectly on a straight line with little scatters. The results from 
the full calculations follow the same line when the biases are relatively small, and are 
bent downward when the biases are large. Noting that the biases depend on three quantities 
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(/cata, Zra, 0"), the lining up of the results indicate that there is a dominant factor among the 
three. Our detailed analyses, presented later in Figure 6, show that the biases are mainly 
determined by (/cata — /cata), and the shape of the redshift distribution in Bin-cata (described 
by the parameters z^, o") has insignificant effects. 

The straight line behavior presents us a quantitative way to describe the joint bias of 
Wq and Wa- We define the joint bias of Wq and Wa for a realization, denoted by 6{wo,Wa), 
as the distance in the {wQ,Wa) plane, between the fiducial point and the best-fit point of 
this realization, with a sign equal to sign[wa(iit) — (fiducial)]. Furthermore, to quantify 
the relative bias with respect to the statistical errors, we define the corresponding joint 
statistical error a{wo,Wa) as the distance in the {wQ,Wa) plane between the fiducial point 
and the intersection point of the straight line with a given statistical error contour. This 
definition accounts correctly for the correlation between wq and Wa- 

To see the dependence of the bias on calibrations, in Figure M, we show the correlation 
of the joint bias S{wo, Wa) defined above and the residual /cata (corresponding to /cata — /cata) 
from each individual realization for A'gpec = 500. It clearly demonstrates that the bias scales 
approximately linearly with the fraction of catastrophic failures. The small dispersions of 
5{wo, Wa) at a fixed residual /cata correspond to results with different calibrated Zm and a for 
Bin-cata. This reveals quantitatively that the effect of the specific distribution within Bin- 
cata is sub-dominant, compared to the impact of the overall fraction of Bin-cata, namely /cata- 
This fact actually allows us to obtain an approximate scaling relation for the dependence of 
the bias on the survey area /sky and on the catastrophic fraction /cata? which will be discussed 
shortly. 

With the biases obtained from individual realizations, we can statistically quantify the 
typical level of bias for a given iVspec. From Figure O and El we see that the average bias 
from all the realizations of a given A^spec is expected to be close to zero. Hence we use the 
root-mean-square bias calculated from the bias distribution to characterize the bias level 
for that particular Nspec- To illustrate it more clearly, in Figure Ul we show the distri- 
bution of 6{wq, Wa) from 500 realizations for Nsp^c = 500. The bias for each individual 
realization is calculated by employing the Fisher matrix formalism Eq. flTI]]) since it can 
give good approximation when the residual /cata is reasonably small. It can be seen that 
the distribution is very close to a zero-mean Gaussian distribution. The rms bias is cal- 
culated to be (T[6{wo,Wa)] ~ 0.61 for Nspec = 500. We perform such a calculation on 
a[6{wo,Wa)] for each considered Nspec, and the results are shown in Figure [H We can 
see that a[S{wo,Wa)] oc 1/ A^spcc to a very good approximation, and can be written as 
a[6{wo,Wa)] = 0.61/ -\/Ai'spec/500. For /sky = 0.025 and /cata = 0.01, the la and 2a Wq and 
Wa joint statistical errors defined above are ~ 0.465 and 0.762, respectively. Then the needed 
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-^V^gpec > 860 and 320 so that the rms bias a[5{wo,Wa)] can be smaller than the la and 2a 
statistical errors, respectively. 

The specific results on the required A^spec shown above are calculated with /sky = 0.025 
and /cata = 1%- However, based on our analyses, we can derive a scaling relation which 
permits us to estimate the needed A^spec for different /sky and /cata- As illustrated in Figure 
El the bias is linearly proportional to /cata — /cata- Thus the rms bias a[6{wo,Wa)] should be 
approximately proportional to the rms of /cata - /cata, namely a(/cata - /cata)- Furthermore, 
for a set of realizations, the number of calibrating galaxies that are identified to be in 
Bin-cata follows the binomial distribution with a mean value of A^specP and the variance of 
NspecPi^ —p), where p = /cata X A^tot/^A's, with A^tot and AA^s representing the total number 
of galaxies and the number of galaxies with 3 < Zphot < 4, respectively. In our consideration, 
we have p ~ 0.15 << 1, and the distribution is close to the Poisson distribution. Then 
(^(/cata - /cata) oc V/cata/ a/A^spcc- Ou the other hand, as we discussed in §3, the absolute 
bias should not depend on the survey area. The statistical error a{wo, Wa) scales as 1/ ^/ fsky 
Therefore the relative bias with respect to the statistical error scales as a/ /cata x /sky/A'spec- 
This in turn gives rise to the scaling relation for the needed A^spec oc /cata x /sky- We then 
have 



for keeping the bias smaller than the la statistical error. Thus for /cata = 1% and /sky = 0.25 
{A = 10, 000 deg^), we would need A^spec > 8600. We emphasize that the number 860 in the 
scaling relation above, depends on the specific detail of the considered catastrophic errors 
and the overall galaxy redshift distribution n{z) [see Eq. f llOl) ]. Our analyses are particularly 
for the island located at (-^spec, -^phot) ~ (0.4,3.5), which is the dominant catastrophic error 
occurrence seen from our photo-z simulations. It is known that the pattern of catastrophic 
errors depends on specific photo-z measurements, notably on the wavelength coverage and 
the filter set used. Thus different sets of filters would result in different catastrophic errors, 
and consequently different calibration requirements. It is noted that the above linear scaling 
relation for /cata is an approximate one as the normalization (860 here) has a mild dependence 
on the fiducial value of /cata- It should also be pointed out that the calibration requirement 
we present does not contain the part needed internally for the photo-z measurements, which 
we assume have been done with considerable accuracy, prior to yielding the /cata = 1% 
catastrophic outliers. 




(13) 
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5. Summciry and Discussion 

In this paper, we concentrate on the island part of the galaxies in the (-Zspcc, -Zphot) 
plot where -Zphot is completely mis-estimated. This type of large catastrophic failures in the 
photometric redshift measurements arise from the confusion of different characteristics of 
galaxies' SED due to the hmited wavelength coverage. The most notable features used in the 
photo-z measurement of galaxies are the Lyman and 4000A breaks. The mis-identification 
of the two results in the islands at {zspec, -s^phot) ~ (0.4, 3.5) and (-Sgpec, -^phot) ~ (3, 0.2) seen in 
Figure 1. We particularly analyze the systematic bias induced by such islands on cosmological 
parameter estimations. We find that a fraction of /cata = 1% catastrophic failures due to 
galaxies at Zgpec ~ 0.4 being misidentified to be at -Zphot ~ 3.5, which is the dominant case 
of catastrophic failure occurrence in our photo-z simulations, can bring systematic biases on 
Wo and Wa at a level far exceeding the statistical errors in 5-tomographic-bin analysis. 

It has been realized that spectroscopic redshift calibration may be necessary for fu- 
ture weak lensing surveys in order to reduce the uncertainty of photometric redshift errors. 
The large bias shown in our analyses reinforces such an argument. We investigate the re- 
quirements of the spectroscopic calibration to retrieve the unbiased parameter estimates. 
We find that for /cata = 1%, and /sky = 0.025, about 400-1000 spectral redshift measure- 
ments for galaxies with Zphot = [3, 4] are needed. We further present a scaling relation of 
-^spec oc /cata X /sky Thus for a future survey with a survey area of 10, 000 deg^, the size 
of the calibration sample in the redshift range Zphot = [3, 4] should be on the order of 10^ 
if /cata remains to be ~ 1%. In analyzing the error degradation from general photometric 
errors, people have argued that the inaccuracy of the high redshift photometric measure- 
ments can be compensated by increasing the measurement accuracy for galaxies at relatively 
low redshifts. Therefore much less spectroscopic calibration samples at high redshifts are 
needed (e.g.. Ma et al. 2006). Concerning the catastrophic-error- induced bias discussed in 
this paper, however, there is no such a compensation. Thus the required ~ 10^ spectral 
calibrations in Zphot = [3, 4] may be regarded as challenging tasks. 

It is also argued that the inclusion of both U and near-IR filters is essential for the 
accuracy of photo-z and the minimization of the catastrophic outliers (e.g., Dahlen et al. 
2008, Abdalla et al. 2008), as the Lyman and 4000A breaks can be effectively distinguished. 
Given that our simulations with 9-filter set already include the near-IR bands, we consider 
the improvements from U band coverage. Extending the wavelength coverage to 3200A, 
our simulations show that the catastrophic failure fraction decreases dramatically to the 
level around 0.1% (Jouvel et al. 2009, in prep.). In this case, the bias should no longer 
be significant compared to the statistical errors. It is thus important to investigate the 
possibility to include < 3500A sensitivity for the next generation space dark energy mission. 
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It has also been proposed to filter out those problematic galaxies, such as late type 
galaxies, low signal-to-noise galaxies, or very low/high redshift (e.g., z < 0.5/ z > 2.5) 
galaxies whose photo-zs are most suspicious, in the shear analysis to reduce the impact of 
catastrophic failures (Abdalla et al. 2008; Margoniner & Wittman 2008; Heavens, Kitching 
& Taylor 2006). In such cases, the statistical power of a survey is inevitably lowered to 
certain extents depending on the fraction of galaxies discarded. In Figure [9], we show the 
degradation of the constraints if those galaxies at z < 0.5 and z > 2.5 (thus 3 bins left) 
are excluded in the analyses, compared to the case with calibrated (known) /cata- We see 
that the Figure of Merit (denoted as F.o.M)0 decreases from 41 to 24, i.e., almost a factor 
of two degradation. If we re-divide the galaxies with 0.5 < z < 2.5 into 5 finer bins (right 
panel of Figure [9]), the F.o.M = 29, or a degradation factor ~ 1.5 (see also table 1 for a 
summary.). All these calculations are done with our fiducial survey conditions and Gaussian 
priors cr^Qh) = 0.01 for and cr{pi) = 0.05 are applied upon all other parameters. 

In our current analyses, we isolate the island galaxies appearing around (^spec^^phot) = 
(0.4, 3.5) to see clearly their effects on cosmic shear analysis. On the other hand, the scatters 
and small bias of 2;phot around 2;spec have been modeled and studied in, for example. Ma et al. 
(2006). With and Zbias as well as their uncertainties Acr^ and A2;bias taken into account, 
the statistical errors of the constrained cosmological parameters will be degraded depending 
on the values of (cr^, Zbias, Aa^, Azbias)- For instance, considering (a^, 2:bias, Acr^, A2;bias) 
= (0.05(1 + z),0, 0.003, 0.003), the degradation factor for wq and Wa is about 1.5 (Ma et al. 
2006). Then a crude estimate based on our analyses gives the needed Nsp^c at 2; ~ [3, 4] 
to be about 8600/(1.5)^ ~ 4000 to control the bias down to la level for /^ata = 1% and 
/sky = 0.25. More complete analyses should be done for both the degradation and the bias 
simultaneously taking into account both the small and the large photo-z errors. 
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Table 1: Comparisons of F.o.M of wq and Wa- Note, the fiducial F.o.M is given under a 
calibrated (known) /cata- Gaussian priors a{flb) = 0.01 for fth and a{pi) — 0.05 are applied 
upon all other parameters. 



Fiducial 5-bins | With cut and 0.5 < z < 2.5 

I 3-bins left | 5-bins re-defined 

F.o.M 41 24 29 



G < Bed < 100 IB < niB5 < 38 ^1 < z < 6 




Fig. 1. — Result of LePhare simulation for our fiducial survey with a 9- filter set. A magnitude 
cut of 26 AB is set to fit the median redshift of the survey. The 2 grey lines show the borders 
inside which the relation l^^gpec — -^photl < 1 holds. 
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Fig. 2. — Source galaxy distribution n{z). The solid curve is the overall galaxy distribution 
defined in Equation Q. 5 tomographic bins are equally spaced between < 2; < 3, denoted 
by dashed vertical lines. All galaxies with z > 3 are simply added into the last bin. 
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Fig. 3. — A sketch to explain where the bias comes from in cases without and with cahbration. 
The top panel shows the 'observed' photo-z distribution of Bin-5. Bottom-left: the true (spectral) 
distribution of Bin-5. Bottom-right: an example of the calibrated distribution of Bin-5, derived 
from an individual realization of calibration simulations. 
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Fig. 4. — Biases on the dark energy parameters wq and Wa in 5 z-bins tomography, from fitting 
(left panel) and from Fisher matrix calculations (right panel), respectively. The fiducial value 
is denoted by the oblique cross, and the plus represents the 'best fit' value. The error contours 
(1, 2, 3a, inside-out) are given at the best fit value for the fit result and at the fiducial value for 
Fisher matrix calculations, respectively. Gaussian priors a{Q,i)) = 0.01 for Qb and aipi) = 0.05 are 
applied upon all other hidden parameters. 
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Fig. 5. — Biases on wq and Wa from 100 realizations for A'gpcc = 100 (upper panels) and A'gpcc = 500 
(lower panels), respectively. Left panels: the results of full calculations including both the auto- and 
cross- correlations between different redshift bins. Right panels: the results considering only the 
auto-correlations within individual redshift bins. In each panel, the pluses and triangles denote the 
'best fit' values of individual realizations from fitting and from the Fisher matrix calculations, 
respectively. The contours correspond to 1,2,3(7 statistical errors computed at the fiducial values. 
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Fig. 6 — Joint Wq and Wa bias, 6{wo, Wa), and the residual /cata (corresponding to /cata-/cata) 
from different realizations, for A^spec — 500. The dispersion of 5{wo,Wa) at a fixed residual 
/cata reflects the effect of different Zm and a estimated from individual realizations. 
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Fig. 7. — The distribution of joint wq and Wa bias 6{wo,Wa) derived from 500 realizations 
of cahbration sampling with A^spec — 500. The dashed line shows a best-fit Gaussian curve 
to the bias distribution histogram. 
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Fig. 8. — Relation of the rms bias, a[S{wo, Wa)], with the number of spectra at 3 < Zphot < 4 
available for calibration, Ngpec- The dashed lines denote the joint la and 2a statistical errors 
of Wo and Wa, respectively. 




-2.0 -1.5 -1.0 -0.5 -2.0 -1.5 -1.0 -0.5 



w„ w„ 

Fig. 9. — The comparisons of constraints on wq and Wa- The sohd contours in both panels 
are the results for the fiducial 5 bin tomography. The dashed contours are the results for 
discarding those galaxies at 2; < 0.5 and z > 2.5, either with the left 3 bins at z=[0.5, 2.5] 
kept unchanged (in left panel) or with galaxies at z=[0.5, 2.5] re-divided into 5 finer bins 
(in right panel). Gaussian priors a{flh) = 0.01 for flf, and a{pi) = 0.05 are applied upon all 
other hidden parameters. 



